117 research outputs found
Data-level hybrid strategy selection for disk fault prediction model based on multivariate GAN
Data class imbalance is a common problem in classification problems, where
minority class samples are often more important and more costly to misclassify
in a classification task. Therefore, it is very important to solve the data
class imbalance classification problem. The SMART dataset exhibits an evident
class imbalance, comprising a substantial quantity of healthy samples and a
comparatively limited number of defective samples. This dataset serves as a
reliable indicator of the disc's health status. In this paper, we obtain the
best balanced disk SMART dataset for a specific classification model by mixing
and integrating the data synthesised by multivariate generative adversarial
networks (GAN) to balance the disk SMART dataset at the data level; and combine
it with genetic algorithms to obtain higher disk fault classification
prediction accuracy on a specific classification model
A Survey of Methods for Handling Disk Data Imbalance
Class imbalance exists in many classification problems, and since the data is
designed for accuracy, imbalance in data classes can lead to classification
challenges with a few classes having higher misclassification costs. The
Backblaze dataset, a widely used dataset related to hard discs, has a small
amount of failure data and a large amount of health data, which exhibits a
serious class imbalance. This paper provides a comprehensive overview of
research in the field of imbalanced data classification. The discussion is
organized into three main aspects: data-level methods, algorithmic-level
methods, and hybrid methods. For each type of method, we summarize and analyze
the existing problems, algorithmic ideas, strengths, and weaknesses.
Additionally, the challenges of unbalanced data classification are discussed,
along with strategies to address them. It is convenient for researchers to
choose the appropriate method according to their needs
Concordance between microsatellite instability and mismatch repair protein expression in colorectal cancer and their clinicopathological characteristics: a retrospective analysis of 502 cases
Microsatellite instability (MSI) is one of the hallmarks of colorectal cancer (CRC). Mismatch repair (MMR) protein expression may reflect MSI status. To analyze the concordance between MSI and MMR expression in CRC and their clinicopathological characteristics, 502 CRC patients were retrospectively collected in this study. Polymerase chain reaction-capillary electrophoresis (PCR-CE) was used to measure MSI, and MMR expression was determined by immunohistochemistry (IHC). The causes of non-concordance were analyzed. Chi-square test was used to find the correlation between MSI and various clinicopathological parameters. PCR-CE results showed 64 (12.7%) patients had high microsatellite instability (MSI-H); low microsatellite instability (MSI-L) and microsatellite stable (MSS) cases were 19 (3.8%)and 419 (83.5%), respectively. With regard to IHC, 430 (85.7%) showed proficient mismatch repair (pMMR) and 72 (14.3%) showed deficient mismatch repair (dMMR). The coincidence rate of MSI and MMR expression in CRC was 98.4% (494/502), with good concordance (Kappa = 0.932). Using PCR-CE as the gold standard, the sensitivity, specificity, positive predictive value, and negative predictive value of IHC were 100%, 98.2%, 88.9%, and 100%, respectively. MSI-H was more common in women, right colon, tumors ≥ 5 cm, ulcerative type, mucinous adenocarcinoma, poor differentiation, T stage I/II, and without lymph node or distant metastasis for CRC patients. In summary, MSI exhibited some typical clinicopathological characteristics. MSI and MMR expression in CRC had good concordance. However, it is still extremely necessary to perform PCR-CE. We recommend that testing packages of different sizes should be developed in clinical practice to create a testing echelon, to facilitate comprehensive selection according to experimental conditions, clinical diagnosis, and treatment needs
Recommended from our members
Functional variant of the carboxypeptidase M (CPM) gene may affect silica-related pneumoconiosis susceptibility by its expression: a multistage case-control study.
ObjectivesIn a genome-wide association study, we discovered chromosome 12q15 (defined as rs73329476) as a silica-related pneumoconiosis susceptibility region. However, the causal variants in this region have not yet been reported.MethodsWe systematically screened eight potentially functional single-neucleotide polymorphism (SNPs) in the genes near rs73329476 (carboxypeptidase M (CPM) and cleavage and polyadenylation specific factor 6 (CPSF6)) in a case-control study including 177 cases with silicosis and 204 healthy controls, matched to cases with years of silica dust exposure. We evaluated the associations between these eight SNPs and the development of silicosis. Luciferase reporter gene assays were performed to test the effects of selected SNP on the activity of CPM in the promoter. In addition, a two-stage case-control study was performed to investigate the expression differences of the two genes in peripheral blood leucocytes from a total of 64 cases with silicosis and 64 healthy controls with similar years of silica dust exposure as the cases.ResultsWe found a strong association between the mutant rs12812500 G allele and the susceptibility of silicosis (OR=1.45, 95% CI 1.03 to 2.04, p=0.034), while luciferase reporter gene assays indicated that the mutant G allele of rs12812500 is strongly associated with increased luciferase levels compared with the wild-type C allele (p<0.01). Moreover, the mRNA (peripheral blood leucocytes) expression of the CPM gene was significantly higher in subjects with silicosis compared with healthy controls.ConclusionsThe rs12812500 variant of the CPM gene may increase silicosis susceptibility by affecting the expression of CPM, which may contribute to silicosis susceptibility with biological plausibility
Sonic Hedgehog Pathway Is Essential for Maintenance of Cancer Stem-Like Cells in Human Gastric Cancer
Abnormal activation of the Sonic hedgehog (SHH) pathway has been described in a wide variety of human cancers and in cancer stem cells (CSCs), however, the role of SHH pathway in gastric CSCs has not been reported. In this study, we investigated the possibility that abnormal activation of the SHH pathway maintained the characteristics of gastric CSCs. First, we identified cancer stem-like cells (CSLCs) from human gastric cancer cell lines (HGC-27, MGC-803 and MKN-45) using tumorsphere culture. Compared with adherent cells, the floating tumorsphere cells had more self-renewing capacity and chemoresistance. The cells expressing CSCs markers (CD44, CD24 and CD133) were also significantly more in tumorsphere cells than in adherent cells. More importantly, in vivo xenograft studies showed that tumors could be generated with 2×104 tumorsphere cells, which was 100-fold less than those required for tumors seeding by adherent cells. Next, RT-PCR and Western blot showed that the expression levels of Ptch and Gli1 (SHH pathway target genes) were significantly higher in tumorsphere cells than in adherent cells. The results of quantitative real-time PCR were similar to those of RT-PCR and Western blot. Further analysis revealed that SHH pathway blocked by cyclopamine or 5E1 caused a higher reduction in self-renewing capacity of HGC-27 tumorsphere cells than that of adherent cells. We also found that SHH pathway blocking strongly enhanced the efficacy of chemotherapeutic drugs in HGC-27 tumorsphere cells in vitro and in vivo but had no significant effect in adherent cells. Finally, we isolated the tumorspheres from gastric cancer specimen, these cells also had chemoresistance and tumorigenic capacity, and SHH pathway maintained the gastric CSLCs characteristics of tumorsphere cells from primary tumor samples. In conclusion, our data suggested that SHH pathway was essential for maintenance of CSLCs in human gastric cancer
Surface Sediment Diatom Assemblages Response to Water Environment in Dongping Lake, North China
The relationship between the diatom taxa preserved in surface lake sediments and environmental variables in Dongping Lake was explored using multivariate statistical methods. The statistical analysis showed that the lake was eutrophicated in all seasons. Transparency, chlorophyll a (Chla) and total phosphorus (TP) were the dominant environmental factors in spring and summer, and NH4+-N and chemical oxygen demand (COD) were the dominant environmental factors in autumn and winter. Sixteen genera and 43 species of diatom were found in the surface sediments, and the dominant diatom genera were Aulacoseira, Ulnaria, Cyclotella, Navicula and Fragilaria. A redundancy analysis (RDA) and Monte Carlo permutation 20 test revealed that COD, pH, TP, conductivity and transparency were significant factors influencing diatom assemblage change, meaning that the distribution of the diatom assemblages were mostly influenced by nutrient composition, light intensity and ion concentrations
DDX1 is a prognostic biomarker and correlates with immune infiltrations in hepatocellular carcinoma
Abstract Hepatocellular carcinoma (HCC) is one of the leading lethal malignant tumors worldwide. DEAD-box (DDX) family helicases are implicated in numerous human cancers. However, the role of DDX1 in HCC has not yet been fully elucidated. We downloaded gene expression data and clinical information data of HCC from The Cancer Genome Atlas and International Cancer Genome Consortium (ICGC) database and conducted subsequent analyses using the R package and online portal. The results revealed that HCC tissues had higher DDX1 expression compared with either paired or unpaired normal tissues. The increased DDX1 expression was closely related to the advanced pathological grade and histologic grade of HCC. Further analysis suggested that patients with high DDX1 expression contributed to poor prognosis The Cox regression analysis revealed that the expression level of DDX1 was an independent prognostic factor for HCC. In addition, an ICGC cohort was used for external validation. The cBio-Portal, MethSurv, and UALCAN database were used for evaluating the genomic mechanism. Moreover, the Tumor Immune Estimation Resource dataset and QUANTISEQ algorithm revealed that DDX1 expression positively correlates with immune infiltrating cells. We also identified the DDX1-related differentially expressed genes (DEGs) and explored their biological functions by GO, KEGG, and GSEA analyses, which indicated that DDX1 may regulate the progression of HCC. In general, increased DDX1 expression predicts a poor prognosis and drives the progression of HCC
Ecological Wisdom and Inheritance Thinking of the Traditional Village’s Water Resources Management in Taihang Mountains
In the context of China’s rapid urbanization, the landscape texture and ecosystem of traditional villages are being constructively damaged, especially the water ecosystem responsible for maintaining the sustainable development of villages. It is urgent to explore the green construction technology of traditional villages. However, some traditional villages in Taihang Mountain area exist and develop in the environment of drought and flood, which contains rich ecological water resources management wisdom. Therefore, in this paper, Taihang Mountain is selected as the research area to collect data through literature analysis, field research, and in-depth interviews. This study uses ArcGIS spatial analysis and statistical functions to analyze the acquired data and explore the water resources management practices of traditional villages based on three aspects: safety, function, and spirit. Then it summarizes four key inspirations for contemporary water resources management: “adapt to local conditions, pay equal attention to water use and prevention”; “division of labor and cooperation complement each other”, “low cost, low technology, low maintenance” and “make the best use of materials and compound functions”. These advantageous insights can be offered for the preservation of traditional villages, the enhancement of the living environment, and the management and growth of urban stormwater
- …